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DETAILED ACTION 

Response to Arguments 

1. Applicant's arguments, see Appeal Brief, filed 6 July 2009, with respect to the 
rejection(s) of claim(s) 28-34 under 35 U.S.C. 103(a) have been fully considered and 
are persuasive. Therefore, the rejection has been withdrawn. However, upon further 
consideration, a new ground(s) of rejection is made in view of Kochi et al. (User-defined 
Template for Identifying Document Type and Extracting Information from Documents). 
Accordingly, prosecution has been reopened. 

Additionally, Applicant did not address the rejections under 35 U.S.C. 112, 1st 
paragraph or the rejection of claims 1-20 under 35 U.S.C. 103(a) in the Appeal Brief 
filed 6 July 2009. Therefore, these rejections are maintained. 

Claim Rejections - 35 USC §112 

2. The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. 

3. Claims 1 and 11 are rejected under 35 U.S.C. 112, first paragraph, as failing to 
comply with the written description requirement. The claim(s) contains subject matter 
which was not described in the specification in such a way as to reasonably convey to 
one skilled in the relevant art that the inventor(s), at the time the application was filed, 
had possession of the claimed invention. 
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Claims 1 and 1 1 have been amended to recite converting individual portions to 
grayscale, applying an edge detection filter to the grayscale converted said individual 
portions, and thereafter individually rotating said individual portions to align with text 
respectively adjacent said individual portions. The specification describes the skew 
adjustment techniques on page 6, lines 6-16. The first technique identifies boundaries 
of a rectangular page and aligns the boundaries in an orthogonal orientation. Clearly, 
this technique rotates the entire image to correct for overall skew in the image. 
Similarly, a technique of converting the image to grayscale, and applying an edge 
detection filter to the image is described. However, in this technique, once the textual 
regions are identified, skew is modified by rotating the image (not by rotating the 
portions). This indicates that, again, the entire image is rotated to correct for skew. The 
specification does not provide support, therefore, for individually rotating individual 
portions of the text identified in the image to correct for skew. 

4. Claims 4 and 14 are rejected under 35 U.S.C. 112, first paragraph, as failing to 
comply with the written description requirement. The claim(s) contains subject matter 
which was not described in the specification in such a way as to reasonably convey to 
one skilled in the relevant art that the inventor(s), at the time the application was filed, 
had possession of the claimed invention. 

Claims 4 and 14 are directed to removing image blur resulting from portions of a 
surface being located outside a depth of field of an image capturing element. Pages 6-7 
of the originally filed specification describe the various image processing techniques 
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applied prior to performing OCR on the text of the document. Specifically, page 6, line 
17 to page 7, line 2 discusses distortions that arise from a surface (such a large bulge in 
a book). This section, however, only describes fixing warping that occurs from such a 
distortion (e.g. the letters nearer the bulge of the book will appear smaller and curvature 
of the line of text may occur). There is no discussion of blurring (i.e. as related to focus) 
in this section. Page 7, lines 7-13 describes a focusing technique to reduce blurring. 
However, there is no indication that this focusing correction differentiates areas of the 
surface that are blurry due to a depth of field. Rather, it simply states that blur is 
removed from "the image". 

Thus, specifically correcting focus from portions of a surface being located 
outside a depth of field is new matter not found in the originally filed disclosure. 

5. Claims 6-1 0 and 1 6-20 are rejected under 35 U.S.C. 1 1 2, first paragraph, as 
failing to comply with the enablement requirement. The claim(s) contains subject matter 
which was not described in the specification in such a way as to enable one skilled in 
the art to which it pertains, or with which it is most nearly connected, to make and/or use 
the invention. 

Claims 6-10 and 16-20 are directed to capturing a test image and a second 
image that differs from the test image to perform various corrections based on the two 
images. While the claims find support in the originally filed disclosure (i.e. the originally 
submitted claims) there is no description in the specification of taking a test image, let 
alone performing any processing based on a test image and a second image. Thus, 



Application/Control Number: 10/676,273 Page 5 

Art Unit: 2626 

any subject matter in the claims dealing with capturing and processing two different 
images is not enabled by the specification. 



Claim Rejections - 35 USC § 103 

6. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

7. Claims 1 and 2 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Myers et al. (U.S. Patent 7,171,046), in view of Myers et al. (7,031,553). 

In regard to claim 1 , Myers et al. ('046) disclose a system (Fig. 1 ) for the 
automated, audible recitation of text arranged in a sequence of one or more words and 
displayed on a surface area defining an area having a height dimension and a width 
dimension, said area displaying more than one character of said text along each 
dimension (text regions within a captured image, column 4, line 60 to column 5, line 6), 
said system comprising: 

a first element capable of distinguishing individual words in said sequence from 
an image of said surface (OCR processing recognizes words in the image, column 6, 
lines 24-35 and column 4, lines 6-7); 
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a second element capable of audibly reciting the words distinguished by said first 
element, in said sequence (text-to-speech synthesis is performed to audibly output the 
recognized words through a speaker, column 7, lines 60-63 and column 3, line 52); 

a third element capable of capturing an image of said surface such that all 
characters of said text within said area are captured simultaneously (a camera captures 
the image, thus capturing all the text simultaneously, column 4, lines 52-65 and column 
3, lines 52-54); and 

a fourth element capable of automatically processing said captured image so as 
to correct, without user interaction, for image distortion in a portion of said image 
resulting from capturing said image from a non-planar surface having at least a portion 
not parallel to that of an image sensor in said third element (corrective processing is 
applied to correct for imagery captured at an oblique angle, i.e. not parallel to that of the 
image sensor, column 5, lines 35-47), where said processing facilitates automated 
character recognition of text in a captured said image (OCR processing, column 6, lines 
24-35). 

Myers et al. ('046) further identify the processing to correct an image distortion 
resulting from the image having a non-planar surface is performed using the methods 
disclosed in Myers et al. ('553). However, Myers et al. ('046) do not specifically disclose 
the steps of: 

converting individual portions to grayscale; 

applying an edge detection filter to the grayscale-converted said individual 
portions; and 
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thereafter individually rotating said individual portions to align with text 
respectively adjacent said individual portions. 

Myers et al. ('553) disclose a method for correcting an image distortion resulting 
from the image having a non-planar surface (text at an angle relative to the image 
plane, column 8, lines 49-59), comprising: 

converting individual portions to grayscale (to detect text portions in the image, a 
grayscale version of the image is calculated, column 4, lines 1-10); 

applying an edge detection filter to the grayscale-converted said individual 
portions (an edge operator is applied to detect the boundaries of the text portions and 
produce rectangles around the text portions, column 4, lines 8-29); and 

thereafter individually rotating said individual portions to align with text 
respectively adjacent said individual portions (processing to adjust the text captured at 
an oblique angle is applied, column 4, lines 30-41; the corrective processing applied 
being computed independently for each line of text, column 9, lines 63-65). 

Thus, Myers et al. ('046) and Myers et al. ('553) disclose each claimed step, the 
only difference between the claimed invention and the prior art being the lack of actual 
combination of the elements in a single prior art reference. One of ordinary skill in the 
art could have combined Myers et al. ('046) and Myers et al. ('553) by following the 
teachings of Myers et al. ('046) to use the corrective techniques of Myers et al. ('553). 
One of ordinary skill in the art at the time of invention would have recognized that the 
resulting combination would predictably provide an image capturing device with the 
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ability to individually correct for image distortion resulting from portions of the image 
having a non-planar surface. 

Additionally, it would have been obvious to one of ordinary skill in the art at the 
time of invention to combine Myers et al. ('046) and Myers et al. ('553) because portable 
text recognition devices often capture text at an oblique angle, therefore correcting for 
such distortions would improve text detection and extraction, as suggested by Myers et 
al. ('553) (see column 1 , lines 49-60). 

In regard to claim 2, Myers et al. ('046) disclose said first element includes a 
programmable electronic dictionary (a lexicon for the OCR is automatically 
programmatically updated based on information context, column 6, line 59 to column 8, 
line 7). 

8. Claim 3 is rejected under 35 U.S.C. 103(a) as being unpatentable over Myers et 
al. (U.S. Patent 7,171,046), in view of Myers et al. (U.S. Patent 7,031,553), and further 
in view of Wang et al. (U.S. Patent 5,680,478). 

In regard to claim 3, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said first element includes a spell checker. 

Wang et al. disclose a character recognition device that includes a spell checker 
(spell-checking is performed on OCR text, column 9, lines 57-64). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to include a spell checker, 
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because this would increase reliability by ensuring that the recognized text formed real 
words. 

9. Claim 5 is rejected under 35 U.S.C. 103(a) as being unpatentable over Myers et 
al. (U.S. Patent 7,1 71 ,046), in view of Myers etal. (U.S. Patent 7,031,553), and further 
in view of Hiroe et al. (U.S. Patent 7,088,853). 

In regard to claim 5, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said programmable electronic dictionary includes a phonetic module that automatically 
recites an estimated pronunciation of a word to a user for verification. 

Hiroe et al. disclose a device for pronouncing words recognized from an image 
(see Abstract), comprising phonetic module that automatically recites an estimated 
pronunciation of a word to a user for verification (a new word is presented to a camera, 
whereupon an estimated pronunciation is presented to the user to verify if it is correct, 
column 13, lines 27-41 and column 13, line 60 to column 14, line 23). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to include a phonetic 
module that automatically recites an estimated pronunciation of a word to a user for 
verification, because this would allow new words not present in the dictionary to be 
recognized an pronounced correctly, as suggested by Hiroe et al. (column 6, line 67 to 
column 7, line 29). 
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10. Claims 4, 6-12, and 14, and 16-20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Myers et al. (U.S. Patent 7,171 ,046), in view of Myers et al. (U.S. 
Patent 7,031 ,553), and further in view of Nakashima et al. (U.S. Patent 6,721 ,465). 

In regard to claim 4, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said image distortion being correctable by said fourth element includes image blur 
resulting from portions of said surface being located outside a depth of field of said third 
element. 

Nakashima et al. disclose a portable image capture and correction device (see 
Fig. 20) that corrects image distortions resulting from portions of a surface being located 
outside a depth of field of a third element (a focal correction takes several readings, 
including those outside the depth of field, i.e. focused above the document, and 
performs correction to achieve a correct focus, by correcting diffusion, column 16, lines 
36-54 and column 17, lines 33-49). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to correct image blur 
resulting from portions of a surface being located outside a depth of field of the third 
element, because correcting the blur resulting from portions of the surface being located 
outside the depth of field (a diffusion amount) improves the image quality, as taught by 
Nakashima et al., column 3, line 59 to column 4, line 1 1 ). 

In regard to claim 6, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said third element includes a processor having software that instructs said third element 
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to capture a test image of at least a portion of said surface, analyze said test image, and 
based on said analysis, automatically, without user interaction, capture a second image 
that differs from said test image. 

Nakashima et al. disclose a portable image capture and correction device (see 
Fig. 20) that that instructs a third element to capture a test image of at least a portion of 
said surface, analyze said test image, and based on said analysis, automatically, 
without user interaction, capture a second image that differs from said test image (see 
Fig. 23, a first image is taken at step 3004, then the lens is set to a second focal 
position at step 3006, and a second image is taken at step 3006, column 18, lines 16- 
39). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to capture a test image of 
at least a portion of said surface, analyze said test image, and based on said analysis, 
automatically, without user interaction, capture a second image that differs from said 
test image, because taking multiple images allows image correction to be performed 
that reduces the distortion of the image, as taught by Nakashima et al. (column 17, line 
66 to column 18, line 15). 

In regard to claim 7, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said second image corrects for a skewed test image. 

Nakashima et al. disclose said second image corrects for a skewed test image 
(tilt is corrected, column 17, lines 54-64). 
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It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to correct for a skewed 
test image based on said second image, because such processing reduces distortion in 
the image in a low cost manner, as taught by Nakashima et al. (column 17, line 66 to 
column 18, line 15). 

In regard to claim 8, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said second image is more focused than said test image. 

Nakashima et al. disclose said second image is more focused than said test 
image (see Fig. 23, a first reading is taken at a first focus and a second reading is taken 
at a second focus, column 18, lines 16-39). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to focus said second 
image more than said first image, because such processing reduces distortion in the 
image in a low cost manner, as taught by Nakashima et al. (column 17, line 66 to 
column 18, line 15). 

In regard to claim 9, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said second image corrects for a distortion in the test image resulting from capturing 
text from a curved surface. 

Nakashima et al. disclose said second image corrects for a distortion in the test 
image resulting from capturing text from a curved surface (geometric deformation of a 
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document that was originally flat, i.e. curving of the surface, is corrected for, column 17, 
lines 54-64 and column 12, lines 54-57). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to correct for a distortion in 
the test image resulting from capturing text from a curved surface, because such 
processing reduces distortion in the image in a low cost manner, as taught by 
Nakashima et al. (column 17, line 66 to column 18, line 15). 

In regard to claim 10, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said second image is a portion of said first image. 

Nakashima et al. disclose said second image is a portion of said first image (i.e. 
the portion at a second focal length, see column 18, lines 16-38). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) so that said second image 
was a portion of said first image, because such processing reduces distortion in the 
image in a low cost manner, as taught by Nakashima et al. (column 17, line 66 to 
column 18, line 15). 

In regard to claim 1 1 , Myers et al. ('046) disclose a system (Fig. 1 ) for the 
automated, audible recitation of text arranged in a sequence of one or more words and 
displayed on a surface area defining an area having a height dimension and a width 
dimension, said area displaying more than one character of said text along each 
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dimension (text regions within a captured image, column 4, line 60 to column 5, line 6), 
said system comprising: 

a first element capable of distinguishing individual words in said sequence from 
an image of said surface (OCR processing recognizes words in the image, column 6, 
lines 24-35 and column 4, lines 6-7); 

a second element capable of audibly reciting the words distinguished by said first 
element, in said sequence (text-to-speech synthesis is performed to audibly output the 
recognized words through a speaker, column 7, lines 60-63 and column 3, line 52); 

a third element capable of capturing an image of said surface such that all 
characters of said text within said area are captured simultaneously (a camera captures 
the image, thus capturing all the text simultaneously, column 4, lines 52-65 and column 
3, lines 52-54); and 

a fourth element capable of automatically processing said captured image so as 
to correct, without user interaction, for image distortion in a portion of said image 
resulting from capturing said image from a non-planar surface having at least a portion 
not parallel to that of an image sensor in said third element (corrective processing is 
applied to correct for imagery captured at an oblique angle, i.e. not parallel to that of the 
image sensor, column 5, lines 35-47), where said processing facilitates automated 
character recognition of text in a captured said image (OCR processing, column 6, lines 
24-35). 

Myers et al. ('046) further identify the processing to correct an image distortion 
resulting from the image having a non-planar surface is performed using the methods 
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disclosed in Myers et al. ('553). However, Myers et al. ('046) do not specifically disclose 
the steps of: 

converting individual portions to grayscale; 

applying an edge detection filter to the grayscale-converted said individual 
portions; and 

thereafter individually rotating said individual portions to align with text 
respectively adjacent said individual portions. 

Myers et al. ('553) disclose a method for correcting an image distortion resulting 
from the image having a non-planar surface (text at an angle relative to the image 
plane, column 8, lines 49-59), comprising: 

converting individual portions to grayscale (to detect text portions in the image, a 
grayscale version of the image is calculated, column 4, lines 1-10); 

applying an edge detection filter to the grayscale-converted said individual 
portions (an edge operator is applied to detect the boundaries of the text portions and 
produce rectangles around the text portions, column 4, lines 8-29); and 

thereafter individually rotating said individual portions to align with text 
respectively adjacent said individual portions (processing to adjust the text captured at 
an oblique angle is applied, column 4, lines 30-41; the corrective processing applied 
being computed independently for each line of text, column 9, lines 63-65). 

Thus, Myers et al. ('046) and Myers et al. ('553) disclose each claimed step, the 
only difference between the claimed invention and the prior art being the lack of actual 
combination of the elements in a single prior art reference. One of ordinary skill in the 
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art could have combined Myers et al. ('046) and Myers et al. ('553) by following the 
teachings of Myers et al. ('046) to use the corrective techniques of Myers et al. ('553). 
One of ordinary skill in the art at the time of invention would have recognized that the 
resulting combination would predictably provide an image capturing device with the 
ability to individually correct for image distortion resulting from portions of the image 
having a non-planar surface. 

Additionally, it would have been obvious to one of ordinary skill in the art at the 
time of invention to combine Myers et al. ('046) and Myers et al. ('553) because portable 
text recognition devices often capture text at an oblique angle, therefore correcting for 
such distortions would improve text detection and extraction, as suggested by Myers et 
al. ('553) (see column 1 , lines 49-60). 

While Myers et al. ('046) and Myers et al. ('553) disclose said third element is a 
digital camera, Myers et al. ('046) and Myers et al. ('553) are silent as to the details of 
the image capturing portion. 

Nakashima et al. disclose a portable image capture and correction device (see 
Fig. 20) that includes a third element comprising: 

an array of light-sensitive members that each convert light incident on said 
members to respective electromagnetic signals (CCD sensor, column 17, lines 8-10); 

a lens capable of focusing an image on said array (lens, column 17, lines 8-10); 

and 
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a circuit capable of receiving said respective electromagnetic signals and 
creating an electronic image associated with said image (the CCD further creates the 
digital images, column 17, lines 8-10). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to include a standard array of light sensitive members, a lens, and circuit to 
create a digital image in Myers et al. ('046) and Myers et al. ('553), because it would 
allow a quality image could be captured. 

In regard to claim 12, Myers et al. ('046) disclose said first element includes a 
programmable electronic dictionary (a lexicon for the OCR is automatically 
programmatically updated based on information context, column 6, line 59 to column 8, 
line 7). 

In regard to claim 14, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said image distortion being correctable by said fourth element includes image blur 
resulting from portions of said surface being located outside a depth of field of said third 
element. 

Nakashima et al. disclose a portable image capture and correction device (see 
Fig. 20) that corrects image distortions resulting from portions of a surface being located 
outside a depth of field of a third element (a focal correction takes several readings, 
including those outside the depth of field, i.e. focused above the document, and 
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performs correction to achieve a correct focus, by correcting diffusion, column 16, lines 
36-54 and column 17, lines 33-49). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to correct image blur 
resulting from portions of a surface being located outside a depth of field of the third 
element, because correcting the blur resulting from portions of the surface being located 
outside the depth of field (a diffusion amount) improves the image quality, as taught by 
Nakashima et al., column 3, line 59 to column 4, line 1 1 ). 

In regard to claim 16, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said third element includes a processor having software that instructs said third element 
to capture a test image of at least a portion of said surface, analyze said test image, and 
based on said analysis, automatically, without user interaction, capture a second image 
that differs from said test image. 

Nakashima et al. disclose a portable image capture and correction device (see 
Fig. 20) that that instructs a third element to capture a test image of at least a portion of 
said surface, analyze said test image, and based on said analysis, automatically, 
without user interaction, capture a second image that differs from said test image (see 
Fig. 23, a first image is taken at step 3004, then the lens is set to a second focal 
position at step 3006, and a second image is taken at step 3006, column 18, lines 16- 
39). 
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It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to capture a test image of 
at least a portion of said surface, analyze said test image, and based on said analysis, 
automatically, without user interaction, capture a second image that differs from said 
test image, because taking multiple images allows image correction to be performed 
that reduces the distortion of the image, as taught by Nakashima et al. (column 17, line 
66 to column 18, line 15). 

In regard to claim 17, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said second image corrects for a skewed test image. 

Nakashima et al. disclose said second image corrects for a skewed test image 
(tilt is corrected, column 17, lines 54-64). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to correct for a skewed 
test image based on said second image, because such processing reduces distortion in 
the image in a low cost manner, as taught by Nakashima et al. (column 17, line 66 to 
column 18, line 15). 

In regard to claim 18, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said second image is more focused than said test image. 
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Nakashima et al. disclose said second image is more focused than said test 
image (see Fig. 23, a first reading is taken at a first focus and a second reading is taken 
at a second focus, column 18, lines 16-39). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to focus said second 
image more than said first image, because such processing reduces distortion in the 
image in a low cost manner, as taught by Nakashima et al. (column 17, line 66 to 
column 18, line 15). 

In regard to claim 19, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said second image corrects for a distortion in the test image resulting from capturing 
text from a curved surface. 

Nakashima et al. disclose said second image corrects for a distortion in the test 
image resulting from capturing text from a curved surface (geometric deformation of a 
document that was originally flat, i.e. curving of the surface, is corrected for, column 17, 
lines 54-64 and column 12, lines 54-57). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) to correct for a distortion in 
the test image resulting from capturing text from a curved surface, because such 
processing reduces distortion in the image in a low cost manner, as taught by 
Nakashima et al. (column 17, line 66 to column 18, line 15). 
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In regard to claim 20, Myers et al. ('046) and Myers et al. ('553) do not disclose 
said second image is a portion of said first image. 

Nakashima et al. disclose said second image is a portion of said first image (i.e. 
the portion at a second focal length, see column 18, lines 16-38). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046) and Myers et al. ('553) so that said second image 
was a portion of said first image, because such processing reduces distortion in the 
image in a low cost manner, as taught by Nakashima et al. (column 17, line 66 to 
column 18, line 15). 

1 1 . Claim 13 is rejected under 35 U.S.C. 103(a) as being unpatentable over Myers et 
al. (U.S. Patent 7,1 71 ,046), in view of Myers et al. (U.S. Patent 7,031 ,553), further in 
view of Nakashima et al., and further in view of Wang et al. 

In regard to claim 13, Myers et al. ('046), Myers et al. ('553) and Nakashima et al. 
do not disclose said first element includes a spell checker. 

Wang et al. disclose a character recognition device that includes a spell checker 
(spell-checking is performed on OCR text, column 9, lines 57-64). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046), Myers et al. ('553) and Nakashima et al. to 
include a spell checker, because this would increase reliability by ensuring that the 
recognized text formed real words. 
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12. Claim 15 is rejected under 35 U.S.C. 103(a) as being unpatentable over Myers et 
al. (U.S. Patent 7,171 ,046), in view of Myers et al. (U.S. Patent 7,031 ,553), further in 
view of Nakashima et al., and further in view of Hiroe et al. (U.S. Patent 7,088,853). 

In regard to claim 15, Myers et al. ('046), Myers et al. ('553) and Nakashima et al. 
do not disclose said programmable electronic dictionary includes a phonetic module that 
automatically recites an estimated pronunciation of a word to a user for verification. 

Hiroe et al. disclose a device for pronouncing words recognized from an image 
(see Abstract), comprising phonetic module that automatically recites an estimated 
pronunciation of a word to a user for verification (a new word is presented to a camera, 
whereupon an estimated pronunciation is presented to the user to verify if it is correct, 
column 13, lines 27-41 and column 13, line 60 to column 14, line 23). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Myers et al. ('046), Myers et al. ('553) and Nakashima et al. to 
include a phonetic module that automatically recites an estimated pronunciation of a 
word to a user for verification, because this would allow new words not present in the 
dictionary to be recognized an pronounced correctly, as suggested by Hiroe et al. 
(column 6, line 67 to column 7, line 29). 

1 3. Claims 28-31 are rejected under 35 U.S.C. 1 03(a) as being unpatentable over 
Nakashima et al., in view of Myers et al. (U.S. Patent 7,171 ,046). 

In regard to claim 28, Nakashima et al. disclose a cell phone (Fig. 32, pocket 
telephone 1925, column 21, lines 27-29), comprising: 
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a body portion containing a keypad (see Fig. 32), an audio receiver and an audio 
transmitter (a telephone necessarily includes a speaker and microphone for receiving 
and transmitting audio); 

a digital camera in said body portion having an outwardly facing lens (sensor 
1920, column 21, lines 29-32); and 

a processor capable of receiving an image containing a text sequence from said 
digital camera and distinguishing words in said sequence (see Figs. 36A-D, character 
recognition is performed on various areas, column 22, lines 49-60). 

Nakashima et al. do not disclose causing said audio transmitter to recite said 
individual words in said sequence. 

Myers et al. disclose an electronic device including an audio device capable of 
audibly reciting said words in said sequence (text-to-speech synthesis is performed to 
audibly output words recognized by OCR through a speaker, column 7, lines 60-63 and 
column 3, line 52). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Nakashima et al. to include an audio device capable of audibly 
reciting said words in said sequence, because outputting the recognized words audibly 
provides a plurality of applications, such as, e.g. aiding visually impaired individuals, as 
suggested by Myers et al. (column 2, lines 21-29). 

Nakashima et al. and Myers et al. do not disclose storage storing a plurality of 
templates for identifying the layout format of text in an image captured by said digital 
camera. 



Application/Control Number: 10/676,273 Page 24 

Art Unit: 2626 

Kochi et al. disclose a system (Fig. 2) comprising storage (model registration 
module) storing a plurality of templates for identifying the layout format of text in an 
image captured by said digital camera (a model identifying the layout format of a 
document is generated, section 2; so that the document layout format type can be 
determined during recognition, section 3). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to further modify the combination of Nakashima et al. and Myers et al. to 
include storage storing a plurality of templates for identifying the layout format of text in 
an image captured by said digital camera, because including models that identify the 
layout format of the text of a captured image reduces the recognition error rate, as 
taught by Kochi et al. (section 4.2) 

In regard to claim 29, while Kochi et al. disclose saving templates for identifying 
the layout format of text in an image (for business letters, reports, technical papers, 
magazines, and Japanese articles with character strings aligned vertically, section 4.1), 
Nakashima et al., Myers et al., and Kochi et al. do not specifically disclose one of the 
templates is in the layout format of a menu. 

However, Myers et al. disclose menus are a common image that are captured for 
audible recitation in a portable electronic device (see column 4, lines 42-51, naming 
several common images that would be captured by a portable electronic device). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Nakashima et al., Myers et al., and Kochi et al. to include a menu 
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template in the plurality of templates, because this would ensure there was a template 
that would match a commonly applied application for the portable electronic device (i.e. 
reading menus). 

In regard to claim 30, Nakashima et al. disclose said processor is capable of 
correcting for at least one of a skew, blur, and distortion (tilt, diffusion, and geometric 
deformation, column 17, lines 44-64). 

In regard to claim 31 , Nakashima et al. disclose said processor includes a page 
prompt module that is capable of identifying a page number in the header or footer of an 
image, and prompting the audio device to recite a warning to a user if the apparatus 
receives images of pages of text in nonsequential order (a book that is continuously 
read is scanned at the upper or lower part of the image to detect page numbers, and if a 
page is skipped, prior to capturing the image, the user is warned audibly, column 19, 
line 61 to column 20, line 16). 

14. Claim 32 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Nakashima et al., in view of Myers et al. (U.S. Patent 7,171,046), in further view of 
Kochi et al., and further in view of Fujimoto et al. (U.S. Patent Application Publication 
2002/0031264). 

In regard to claim 32, while Kochi et al. disclose saving templates for identifying 
the layout format of text in an image (for business letters, reports, technical papers, 
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magazines, and Japanese articles with character strings aligned vertically, section 4.1), 
Nakashima et al., Myers et al., and Kochi et al. do not specifically disclose at least one 
of said templates is in the layout format of a newspaper. 

Fujimoto et al. disclose an electronic device for capturing images to be 
recognizes, wherein the layout format of the image is determined, including the layout 
format of a newspaper (see Fig. 1 , a newspaper image's layout is determined, page 3 
paragraph 60). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Nakashima et al., Myers et al., and Kochi et al. to include a template 
for a layout format of a newspaper, because determining the layout of newspaper text is 
advantageous for recognizing characters in the newspaper, as taught by Fujimoto et al. 
(page 1 , paragraph 3). 

15. Claim 33 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Nakashima et al., in view of Myers et al. (U.S. Patent 7,171,046), in further view of 
Kochi et al., and further in view of Piehn et al. (U.S. Patent Application Publication 
2001/0056342). 

In regard to claim 33, while Kochi et al. disclose saving templates for identifying 
the layout format of text in an image (for business letters, reports, technical papers, 
magazines, and Japanese articles with character strings aligned vertically, section 4.1), 
Nakashima et al., Myers et al., and Kochi et al. do not specifically disclose one of said 
templates corresponds to a phone book. 
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Piehn et al. disclose portable device for audibly reciting words recognized in an 
image (see Abstract), wherein phone books are a common image captured for audible 
recitation by the device (see page 3, paragraph 28, naming several common images 
that would be captured by a portable electronic device). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Nakashima et al., Myers et al., and Kochi et al. to include a phone 
book layout template, because this would ensure there was a template that would 
match a commonly applied application for the portable electronic device (i.e. reading 
phone books). 

16. Claim 34 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Nakashima et al., in view of Myers et al. (U.S. Patent 7,171,046), in further view of 
Kochi et al., in further view of Piehn et al., and further in view of Maes et al. (U.S. Patent 
7,092,496). 

In regard to claim 34, Nakashima et al., Myers et al., Kochi et al., and Piehn et al. 
do not disclose the cell phone includes a button and said one of said templates instructs 
said processor to dial the phone number of a phone book entry being recited when the 
user presses the button. 

Maes et al. disclose a cell phone that includes a button (Key for providing keyed- 
in command) that instructs a processor to dial the phone number of a phone book entry 
being recognized when the user presses said button (a user instructs the phone, via a 
keyed-in command, to visually recognize a phone number and automatically dial the 
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number that is recognized, see column 9, lines 59-67 and specific visual application at 
column 10, lines 48-62). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to further modify Nakashima et al., Myers et al., Kochi et al., and Piehn et al. 
to instruct the processor to dial the phone number of a phone book entry being recited 
when the user pressed the button, because this would provide a user a convenient, 
"one-touch" method for capturing and dialing a phone number out of a phone book. 

Conclusion 

1 7. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to BRIAN L. ALBERTALLI whose telephone number is 
(571 )272-7616. The examiner can normally be reached on Monday-Thursday, 8 AM to 
6:30 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on (571) 272-7843. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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